Dominant block guided optimal cache size estimation to maximize IPC of embedded software

نویسندگان

Rajendra Patel

Arvind Rajawat

چکیده

Embedded system software is highly constrained from performance, memory footprint, energy consumption and implementing cost view point. It is always desirable to obtain better Instructions per Cycle (IPC). Instruction cache has major contribution in improving IPC. Cache memories are realized on the same chip where the processor is running. This considerably increases the system cost as well. Hence, it is required to maintain a trade-off between cache sizes and performance improvement offered. Determining the number of cache lines and size of cache line are important parameters for cache designing. The design space for cache is quite large. It is time taking to execute the given application with different cache sizes on an instruction set simulator (ISS) to figure out the optimal cache size. In this paper, a technique is proposed to identify a number of cache lines and cache line size for the L1 instruction cache that will offer best or nearly best IPC. Cache size is derived, at a higher abstraction level, from basic block analysis in the Low Level Virtual Machine (LLVM) environment. The cache size estimated from the LLVM environment is cross validated by simulating the set of benchmark applications with different cache sizes in SimpleScalar’s outof-order simulator. The proposed method seems to be superior in terms of estimation accuracy and/or estimation time as compared to the existing methods for estimation of optimal cache size parameters (cache line size, number of cache lines).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Code Size Efficiency in Global Scheduling for VLIW/EPIC Style Embedded Processors

In embedded computing, code size is very important for system cost and performance. In global scheduling for VLIW/EPIC style embedded processors, region-enlarging optimizations, especially tail duplication, are commonly used to exploit instruction level parallelism (ILP) to boost the performance. The code size increase due to such optimizations, however, raises serious concerns about the affect...

متن کامل

A Hybrid Fuzzy MCDM Approach to Determine an Optimal Block Size in Open-Pit Mine Modeling: a Case Study

The computer-based 3D modeling of ore bodies is one of the most important steps in the resource estimation, grade determination, and production scheduling of open-pit mines. In the modeling phase, the volume of the orebody model is required to be filled by the blocks and sub-blocks. The determination of Block Size (BS) is important due to the dependence of the geostatistical issues and calculat...

متن کامل

Contents II Cache Awareness in Blocking Techniques 76 8

To date, data locality optimizing algorithms mostly aim at providing e cient strategies for blocking and reordering loops. But little research has been devoted to the nal step, i.e., computing the optimal block size. Optimal block sizes are currently computed as if a cache behaves as a local memory, i.e., cache interference phenomena are ignored. Case-studies have already shown that cache inter...

متن کامل

Refining the Utility Metric for Utility-Based Cache Partitioning

It is expected that future high-performance processors will implement large L2 or L3 caches that will be shared by multiple cores. Allocating shared cache space among multiple programs is an important problem and has received much attention in recent years. Typically, the ways or sets of a cache are partitioned across multiple competing programs. Most recent work is built on the concept of marg...

متن کامل

The Effect of Nanometer-Scale Technologies on the Cache Size Selection for Low Energy Embedded Systems

Several studies have shown that cache memories account for more than 40% of the total energy consumed in processor-based embedded systems. In microscale technology nodes, active power is the primary contributor to the total power dissipation of a CMOS chip. However, in nano-scale technologies, leakage power is dominating total power dissipation. In this paper, we show that this fact affects the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1312.2306 شماره

صفحات -

تاریخ انتشار 2013

Dominant block guided optimal cache size estimation to maximize IPC of embedded software

نویسندگان

چکیده

منابع مشابه

Code Size Efficiency in Global Scheduling for VLIW/EPIC Style Embedded Processors

A Hybrid Fuzzy MCDM Approach to Determine an Optimal Block Size in Open-Pit Mine Modeling: a Case Study

Contents II Cache Awareness in Blocking Techniques 76 8

Refining the Utility Metric for Utility-Based Cache Partitioning

The Effect of Nanometer-Scale Technologies on the Cache Size Selection for Low Energy Embedded Systems

عنوان ژورنال:

اشتراک گذاری